Tricolor DAGs for Machine Translation 

Koichi Takeda 









> 

00 

o 
o 
l> 

o 

as 

a 

> 

X 



IBM Research, Tokyo Research Laboratory 

1623-14 Shimotsuruma, Yamato, Kanagawa 242, Japan 

Phone: 81-462-73-4569, 81-462-73-7413 (FAX) 

takeda@trLvnet.ibni. com 



Abstract 

Machine translation (MT) has recently been formulated 
in terms of constraint-based knowledge representation 
and unification theories, but it is becoming more and 
more evident that it is not possible to design a prac- 
tical MT system without an adequate method of han- 
dling mismatches between semantic representations in 
the source and target languages. In this paper, we in- 
troduce the idea of "information-based" MT, which is 
considerably more flexible than interlingual MT or the 
conventional transfer-based MT. 

Introduction 

With the intensive exploration of contemporary the- 
ories on unification grammars |6[ |l^, |l^ and feature 
structures0, 19 in the last decade, the old image of 
machine translation (MT) as a brutal form of natural 
language processing has given way to that of a process 
based on a uniform and reversible architecture [p, 6[ |l| 

The developers of MT systems based on the 
constraint-based formalism found a serious problem in 
"language mismatching," namely, the difference be- 
tween semantic representations in the source and tar- 
get languages.^] Attempts to design a pure interlingual 
MT system were therefore abandoned,^ and the no- 
tion of "semantic transfer" EJ, E2] came into focus as 
a practical solution to the problem of handling the lan- 
guage mismatching. The constraint-based formalismB 
seemed promising as a formal definition of transfer, but 
pure constraints are too rigid to be precisely imposed 
on target-language sentences. 

Some researchers(e.g., Russell p^) introduced the 
concept of defeasible reasoning in order to formalize 
what is missing from a pure constraint-based approach, 

^For example, Yasuharapq] reported there was an over- 
lap of only 10% between his group's English and Japanese 
concept dictionaries, which covered 0.2 million concepts. 

^Even an MT system with a controlled input 
language [[12] does not claim to be a pure interlingual system. 



and control mechanisms for such reasoning have also 
been proposed [|[ ^. With this additional mechanism, 
we can formulate the "transfer" process as a mapping 
from a set of constraints into another set of manda- 
tory and defeasible constraints. This idea leads us fur- 
ther to the concept of "information-based" MT, which 
means that, with an appropriate representation scheme, 
a source sentence can be represented by a set of con- 
straints that it implies and that, given a target sentence, 
the set Cs of constraints can be divided into three dis- 
joint subsets: 

• The subset Co of constraints that is also implied by 
the target sentence 

• The subset C+ of constraints that is not implied by, 
but is consistent with, the translated sentence 

• The subset C_ of constraints that is violated by the 
target sentence 

The target sentence may also imply another set Cnew 
of constraints, none of which is in Cs- That is, the 
set Ct of constraints implied by the target sentences is 
a union of Cq and Cnew 7 while Cs = Co U C+ U C_. 
When Cs = Co = Ct , we have a fully interlingual trans- 
lation of the source sentence. If C+ ^ </>, C_ = (f>, and 
Cnew = 4>, the target sentence is said to be under- 
generated, while it is said to be over- generated when 
C+ = 4>,C- = (j), and Cnew 7^ 0.0 In either case, C_ 
must be empty if a consistent translation is required. 
Thus, the goal of machine translation is to find an opti- 
mal pair of source and target sentences that minimizes 
C+,C-, and Cnew- Intuitively, Cq corresponds to es- 
sential information, and C+ and Cnew can be viewed as 
language-dependent supportive information. C_ might 
be the inconsistency between the assumptions of the 
source- and target-language speakers. 



The notions of completeness and cotierence in LFG||q] 
have been employed by Wedekindpa] to avoid over- and 
under-generation. 



In this paper, we introduce tricolor DAGs to rep- 
resent the above constraints, and discuss how tricolor 
DAGs are used for practical MT systems. In particular, 
we give a generation algorithm that incorporates the 
notion of semantic transfer by gradually approaching 
the optimal target sentence through the use of tricolor 
DAGs, when a fully interlingual translation fails. Tri- 
color DAGs give a graph- algorithmic interpretation of 
the constraints, and the distinctions between the types 
of constraint mentioned above allow us to adjust the 
margin between the current and optimal solution effec- 
tively. 

Tricolor DAGs 

A tricolor DAG (TDAG, for short) is a rooted, directed, 
acyclicy graph with a set of three colors (red, yellow, 
and green) for nodes and directed arcs. It is used to 
represent a feature structure of a source or target sen- 
tence. Each node represents either an atomic value or 
a root of a DAG, and each arc is labeled with a feature 
name. The only difference between the familiar usage 
of DAGs in unification grammars and that of TDAGs 
is that the color of a node or arc represents its degree 
of importance: 

1. Red shows that a node (arc) is essential. 

2. Yellow shows that a node (arc) may be ignored, but 
must not be violated. 

3. Green shows that a node (arc) may be violated. 

For practical reasons, the above distinctions are inter- 
preted as follows: 

1. Red shows that a node (arc) is derived from lexicons 
and grammatical constraints. 

2. Yellow shows that a node (arc) may be inferred from 
a source or a target sentence by using domain knowl- 
edge, common sense, and so on. 

3. Green shows that a node (arc) is defeasibly inferred, 
specified as a default, or hcuristically specified. 

When all the nodes and arcs of TDAGs are red, TDAGs 
are basically the same as the feature structures^ of 



grammar-based translation! 25, M. A TDAG is well 



formed iff the following conditions are satisfied: 
1. The root is a red node. 



''Acyclicity is not crucial to the results in this paper, but 
it significantly simplifies the definition of the tricolor DAGs 
and semantic transfer. 

^We will only consider the semantic portion of the fea- 
ture structure although the theory of tricolor DAGS for rep- 
resenting entire feature structures is an interesting topic. 



2. Each red arc connects two red nodes. 

3. Each red node is reachable from the root through the 
red arcs and red nodes. 

4. Each yellow node is reachable from the root through 
the arcs and nodes that are red and/or yellow. 

5. Each yellow arc connects red and/or yellow nodes. 

6. No two arcs start from the same node, and have the 
same feature name. 

Conditions 1 to 3 require that all the red nodes and 
red arcs between them make a single, connected DAG. 
Condition 4 and 5 state that a defeasible constraint 
must not be used to derive an imposed constraint. In 
the rest of this paper, we will consider only well-formed 
TDAGs. Furthermore, since only the semantic portions 
of TDAGs are used for machine translation, we will not 
discuss syntactic features. 

The suhsumption relationship among the TDAGs is 
defined as the usual subsumption over DAGs, with the 
following extensions. 

• A red node (arc) subsumes only a red node (arc). 

• A yellow node (arc) subsumes a red node (arc) and a 
yellow node (arc). 

• A green node (arc) subsumes a node (arc) with any 
color. 

The unification of TDAGs is similarly defined. The 
colors of unified nodes and arcs are specified as follows: 

• Unification of a red node (arc) with another node 
(arc) makes a red node (arc). 

• Unification of a yellow node (arc) with a yellow or 
green node (arc) makes a yellow node (arc). 

• Unification of two green nodes (arcs) makes a green 
node (arc). 

Since the green nodes and arcs represent defeasible con- 
straints, unification of a green node (either a root of a 
TDAG or an atomic node) with a red or yellow node 
always succeeds, and results in a red or yellow node. 
When two confiicting green nodes are to be unified, the 
result is indefinite, or a single non-atomic green node.pl 
Now, the problem is that a red node/arc in a source 
TDAG (the TDAG for a source sentence) may not al- 
ways be a red node/arc in the target TDAG (the TDAG 
for a target sentence). For example, the functional con- 
trol of the verb "wish" in the English sentence 

''An alternative definition is that one green node has 
precedence over the other llj]. Practically, such a conflicting 
unification should be postponed until no other possibility is 
found. 



making a reasonable guess. Even if such contextual 
processing could be integrated into a logical inference 
system, the obtained information should be defeasible, 
and hence should be represented by green nodes and 
arcs in the TDAGs. Pronoun resolution can be simi- 
larly represented by using green nodes and arcs. 

It is worth looking at the source and target TDAGs 
in the opposite direction. From the Japanese sentence, 

John ha aruku koto wo nozonda 
John +subj walk +nom +obj wished 

we get the source TDAG3 in Figure |l], where functional 
control and number information are missing. With 
the help of contextual processing, we get the target 
TDAG4, which can be used to generate the English 
sentence "John wished to walk." 



Figure 1: Sample TDAGs 



John wished to walk 

may produce the TDAGi in Figure |^, but the red arc 
corresponding to the agent of the *WALK predicate 
may not be preserved in a target TDAG2 jj This means 
that the target sentence alone cannot convey the infor- 
mation that it is John who wished to walk, even if this 
information can be understood from the context. Hence 
the red arc is relaxed into a yellow one, and any target 
TDAG must have an agent of *WALK that is consis- 
tent with *JOHN. This relaxation will help the sentence 
generator in two ways. First, it can prevent generation 
failure (or non-termination in the worst case). Second, 
it retains important information for a choosing correct 
translation of the verb "walk" .^ 

Another example is the problem of identifying num- 
ber and determiner in Japanese-to-English translation. 
This type of information is rarely available from a syn- 
tactic representation of a Japanese noun phrase, and 
a set of heuristic rules |plj is the only known basis for 



^For example, the Japanese counterpart "nozomu" for 
the verb "wish" only takes a sentential complement, and no 
functional control is observed. 

^Whether or not the subject of the verb is human is 
often crucial information for making an appropriate choice 
between the verb's two Japanese counterparts "aruku" and 
"hokousuru" . 



Semantic Transfer 

As illustrated in the previous section, it is often the 
case that we have to solve mismatches between source 
and target TDAGs in order to obtain successful trans- 
lations. Syntactic/semantic transfer has been formu- 
lated by several researchers [|8[ ^ as a means of han- 
dling situations in which fully interlingual translation 
does not work. It is not enough, however, to capture 
only the equivalent relationship between source and tar- 
get semantic representations: this is merely a mapping 
among red nodes and arcs in TDAGs. What is missing 
in the existing formulation is the provision of some mar- 
gin between what is said and what is translated. The 
semantic transfer in our framework is defined as a set of 
successive operations on TDAGs for creating a sequence 
of TDAGs to, ti, . . ., th such that to is a source TDAG 
and tk is a target TDAG that is a successful input to 
the sentence generator. 

A powerful contextual processing and a domain 
knowledge base can be used to infer additional facts 
and constraints, which correspond to the addition of 
yellow nodes and arcs. Default inheritance, proposed 
by Russell et al.||lj], provides an efficient way of ob- 
taining further information necessary for translation, 
which corresponds to the addition of green nodes and 
arcs. A set of well-known heuristic rules, which we will 
describe later in the "Implementation" Section, can also 
be used to add green nodes and arcs. To complete 
the model of semantic transfer, we have to introduce 
a "painter." A painter maps a red node to either a yel- 
low or a green node, a yellow node to a green node, 
and so on. It is used to loosen the constraints imposed 
by the TDAGs. Every application of the painter mono- 
tonically loses some information in a TDAG, and only a 
finite number of applications of the painter are possible 
before the TDAG consists entirely of green nodes and 
arcs except for a red root node. Note that the painter 



never removes a node or an arc from a TDAG, it simply 
weakens the constraints imposed by the nodes and arcs. 
FormaUy, semantic transfer is defined as a sequence 
of the following operations on TDAGs: 

• Addition of a yellow node (and a yellow arc) to a 
given TDAG. The node must be connected to a node 
in the TDAG by a yellow arc. 

• Addition of a yellow arc to a given TDAG. The arc 
must connect two red or yellow nodes in the TDAG. 

• Addition of a green node (and a green arc) to a given 
TDAG. The node must be connected to a node in the 
TDAG by the green arc. 

• Addition of a green arc to a given TDAG. The arc 
can connect two nodes of any color in the TDAG. 

• Replacement of a red node (arc) with a yellow one, 
as long as the well-formedness is preserved. 

• Replacement of a yellow node (arc) with a green one, 
as long as the well-formedness is preserved. 

The first two operations define the logical implications 
(possibly with common sense or domain knowledge) of a 
given TDAG. The next two operations define the defea- 
sible (or heuristic) inference from a given TDAG. The 
last two operations define the painter. The definition 
of the painter specifies that it can only gradually relax 
the constraints. That is, when a red or yellow node (or 
arc) X has other red or yellow nodes that are only con- 
nected through X, X cannot be "painted" until each of 
the connected red and yellow nodes is painted yellow or 
green to maintain the reachability through X. 

In the sentence analysis phase, the first four oper- 
ations can be applied for obtaining a source TDAG 
as a reasonable semantic interpretation of a sentence. 
The application of these operations can be controlled 
by "weighted abduction" [^ , default inheritance, and so 
on. These operations can also be applied at seman- 
tic transfer for augmenting the TDAG with a common 
sense knowledge of the target language. On the other 
hand, these operations are not applied to a TDAG in 
the generation phase, as we will explain in the next 
section. This is because the lexicon and grammatical 
constraints are only applied to determine whether red 
nodes and arcs are exactly derived. If they are not ex- 
actly derived, we will end up with either over- or under- 
generation beyond the permissible margin. Semantic 
transfer is applied to a source TDAG as many timesn 



The iteration is bounded by the number of nodes and 
arcs in the TDAG, although the number of possible se- 
quences of operations could be exponential. 



as necessary until a successful generation is made. Re- 
call the sample sentence in Figure |], where two painter 
calls were made to change two red arcs in TDAGi into 
yellow ones in TDAG2 ■ These are examples of the first 
substitution operation shown above. An addition of a 
green node and a green arc, followed by an addition of 
a green arc, was applied to TDAG3 to obtain TDAG 4,. 
These additions are examples of the third and fourth 
addition operations. 

Sentence Generation Algorithm 

Before describing the generation algorithm, let us look 
at the representation of lexicons and grammars for ma- 
chine translation. A lexical rule is represented by a set 
of equations, which introduce red nodes and arcs into 
a source TDAG.Fj A phrasal rule is similarly defined 
by a set of equations, which also introduce red nodes 
and arcs for describing a syntactic head and its comple- 
ments. 

For example, if we use Shieber's PATR-II|1^ nota- 
tion, the lexical rule for "wished" can be represented as 
follows: 

V -^ wished 
(V cat) = V 
(V form) = past 
(V subj cat) = np 
(V obj cat) = V 
(V obj form) = infinitival 
(V pred) = *WISH 
(V pred agent) — (V subj pred) 
(V pred theme) = (V obj pred) 
(V pred theme agent) = (V subj pred) 

The last four equations are semantic equations. Its 
TDAG representation is shown in Figure 0. It would 
be more practical to further assume that such a lexical 
rule is obtained from a type inference system,[J which 
makes use of a syntactic class hierarchy so that each 
lexical class can inherit general properties of its super- 
classes. Similarly, semantic concepts such as *WISH 
and *WALK should be separately defined in an onto- 
logical hierarchy together with necessary domain knowl- 
edge (e.g., selectional constraints on case fillers and 
part- 0/ relationships. See KBMT-89|||.) A unification 
grammar is used for both analysis and generation. Let 
us assume that we have two unification grammars for 
English and Japanese. Analyzing a sentence yields a 
source TDAG with red nodes and arcs. Semantic inter- 
pretation resolves possible ambiguity and the resulting 

^"For simplicity, we will only consider semantic equations 
to form the TDAGs. 

^^as in Shieber[[15|, Pollard and Sag|13], and Russell et 



Figure 2: TDAG representation of the verb "wished" 
(embedded in the entire feature structure) 



Figure 4: Target TDAG for the sentence "The Boston 
Office called." 



Figure 3: Source TDAG for the sentence "The Boston 
Office called." 



TDAG may include all kinds of nodes and arcs, 
example, the sentenceFj 



For 



The Boston office called 

would give the source TDAG in Figure |3[ By utilizing 
the domain knowledge, the node labeled *PERSON is 
introduced into the TDAG as a real caller of the action 
*CALL, and two arcs representing *PERSON work- for 
*OFFICE a.nd *OFFICE in ^BOSTOiV are abductively 
inferred. 

Our generation algorithm is based on Wedekind's 
DAG traversal algorithmic^ for LFG.[j The algorithm 
runs with an input TDAG by traversing the nodes and 
arcs that were derived from the lexicon and grammar 
rules. The termination conditions are as follows: 

• Every red node and arc in the TDAG was derived. 

• No new red node (arc) is to be introduced into the 
TDAG if there is no corresponding node (arc) of any 
color in the TDAG. That is, the generator can change 
the color of a node (arc) to red, but cannot add a new 
node (arc). 



^^in Hobbs et al.[|] 

^^It would be identical to Wedekind's algorithm if an in- 
put TDAG consisted of only red nodes and arcs. 



• For each set of red paths (i.e., the sequence of red 
arcs) that connects the same pair of nodes, the reen- 
trancy was also derived. 

These conditions are identical to those of Wedekind 
except that yellow (or green) nodes and arcs may or 
may not be derived. For example, the sentence "The 
Boston Office called" in Figure pi can be translated into 
Japanese by the following sequence of semantic transfer 
and sentence generation. 

1. Apply the painter to change the yellow of the definite 
node and the de/arc to green. 

2. Apply the painter to change the yellow of the singular 
node and the num arc to green. The resulting TDAG 
is shown in Figure 0. 

3. Run the sentence generator with an input feature 
structure, which has a root and an arc pred connect- 
ing to the given TDAG. (See the node marked "1" in 
Figure |.) 

4. The generator applies a phrasal rule, say S —f NP VP, 
which derives the suhj arc connecting to the subject 
NP (marked "2"), and the agent arc. 

5. The generator applies a phrasal rule, say NP -^ MOD 
NP,[J which derives the npmod arc to the modifier of 
the NP (marked "3" ) and the mod arc. 

6. Lexical rules are applied and all the semantic nodes, 
*CALL, *OFFICE, and *BOSTON are derived. 



^■^ There are several phrasal rules for deriving this LHS 
NP in Japanese: (1) A noun-noun compound, (2) a noun, 
copula, and a noun, and (3) a noun, postpositional particle, 
and a noun. These three rules roughly correspond to the 
forms (1) Boston OfRce, (2) office of Boston, and (3) office in 
Boston. Inference of the "*OFFICE m *B OS TON " relaiion 
is easiest if rule (3) is used, but the noun-noun compound 
is probably the best translation. 



; ; run the generator with input f-structure 
0> *J-GG-START called with 
((PRED "yobu") (CAT V) (VTYPE V-5DAN-B) 
(SUBCAT TRANS) (ASP-TYPE SHUNKAN) 
(:MDOD ((PRED "@dec"))) 
(AUX ((PRED "@aux") (:TIME ((PRED "Opast"))) 

(: PASSIVE ((PRED "Ominus") )) )) 
(SUBJ ((CAT N) (PRED "jimusho") 
(XADJUNCT ((XCDP "deno") (CAT N) 
(PRED "Boston")))))) 

3> *J-GG-S called ;;<start> ->...-> <S> 
4> *J-GG-XP called with ; ;subj-f iller 
((CASE (*DR* "ha" "ga")) (CAT N) 
(NEG *UNDEFINED*) (PRED "jimusho") 
(XADJUNCT ((COP -) (CAT N) 
(PRED "Boston")))) 
5> *J-GG-NP called ; ;head NP of subj 

10< *GG-N-ROOT returns ; ;np mod 
"Boston" ; ; "Boston" 

9> *J-GG-N called ; ;head np 

10< *GG-N-ROOT returns 
"jimusho" 

7< *9 (<SS> <NP>) returns ; ;mod+NP 
"Boston deno jimusho" 

5< *1 (<NP> <P>) returns ; ;NP+case-marker 
"Boston deno jimusho ha" 
4< *J-GG-XP returns "Boston deno jimusho ha" 
4> *J-GG-S called with ;;VP part 
5> *J-GG-VP called ;;stem + 
6> *J-GG-V called ;;function word chains 
((SUBJ *UNDEFINED*) 
(ADVADJUNCT *UNDEFINED*) 
(PPADJUNCT *UNDEFINED*) 
(:MDOD *UNDEFINED*) 
(AUX ((:TIME ((PRED "Opast"))) 
(: PASSIVE 

((PRED (*0R+ *UNDEFINED* "@minus")))) 
(PRED "@aux"))) 
(CAT V) (TYPE FINAL) (ASP-TYPE SHUNKAN) 
(VTYPE V-5DAN-B) (SUBCAT TRANS) 
(PRED "yobu")) 
7> *J-GG-RENTAI-PAST called ;; past-form 

14< *GG-V-ROOT returns "yo" ;;stem 

6< *J-GG-V returns "yobi mashita" 
5< *J-GG-VP returns "yobi mashita" 
4< *J-GG-S returns "yobi mashita" 
3< *J-GG-S returns 

"Boston deno jimusho ha yobi mashita" 

0< *J-GG-START returns 

"Boston deno jimusho ha yobi mashita" 

Figure 5: Sentence generation from the TDAG for "The 
Boston Office called." 



The annotated sample run of the sentence generator 
is shown in Figure |^. The input TDAG in the sample 
run is embedded in the input feature structure as a set 
of PRED values, but the semantic arcs are not shown 
in the figure. The input feature structure has syntactic 
features that were specified in the lexical rules. The 
feature value *UNDEFINED* is used to show that the 
node has been traversed by the generator. 

The basic property of the generation algorithm is as 
follows: 

Let i be a given TDAG, t,nin be the connected 
subgraph including all the red nodes and arcs in i, 
and tmax be the connected subgraph of t obtained 
by changing all the colors of the nodes and arcs 
to red. Then, any successful generation with the 
derived TDAG tg satisfies the condition that tmm 
subsumes tg, and tg subsumes tmax- 

The proof is immediately obtained from the definition 
of successful generation and the fact that the gener- 
ator never introduces a new node or a new arc into 
an input TDAG. The TDAGs can also be employed 
by the semantic head-driven generation algorithm |17]| 
while retaining the above property. Semantic mono- 
tonicity always holds for a TDAG, since red nodes must 
be connected. It has been shown by Takedaplf that se- 
mantically non-monotonic representations can also be 
handled by introducing a functional semantic class. 

Implementation 

We have been developing a prototype English-to- 
Japanese MT system, called Shalt2[|2^, with a lexicon 
for a computer-manual domain including about 24,000 
lexemes each for English and Japanese, and a general 
lexicon including about 50,000 English words and their 
translations. A sample set of 736 sentences was col- 
lected from the "IBM AS/400 Getting Started" man- 
ual, and was tested with the above semantic transfer 
and generation algorithm.^ The result of the syntactic 
analysis by the English parser is mapped to a TDAG 
using a set of semantic equation^^ obtained from the 
lexicons. We have a very shallow knowledge base for 
the computer domain, and no logical inference system 
was used to derive further constraints from the given 
source sentences. The Japanese grammar is similar to 
the one used in KBMT-89, which is written in pseudo- 
unification ||2^ equations, but we have added several 
new types of equation for handling coordinated struc- 
tures. The Japanese grammar can generate sentences 

^^We used McCord's English parser based on his English 
Slot Grammar |lO|, which covered more than 93% of the 
sentences. 

^^We call such a set of semantic equations mapping rules 
(see Shalt2ra or KBMT-89§). 



from all the successful TDAGs for the sample English 
sentences. 

It turned out that there were a few collections of 
semantic transfer sequences which contributed very 
strongly to the successful generation. These sequences 
include 

• Painting the functional control arcs in yellow. 

• Painting the gaps of relative clauses in yellow. 

• Painting the number and definiteness features in yel- 
low. 

• Painting the passivization feature in green. r| 

Other kinds of semantic transfer are rather idiosyn- 
cratic, and are usually triggered by a particular lexical 
rule. Some of the sample sentences used for the trans- 
lations are as follows]^ 

Make sure you are using the proper edition 
for the level of the product. 

YUUZAA HA SEIHIN NO REBERU NI 
user +subj product +pos level +for 
TEKISETSUNA HAN WO SIYOUSHITE IRU 

proper edition +obj use +pi'og 

KOTO WO TASHIKAMETE KUDASAI 
+nom +obj confirm +imp 

Publications are not stocked at the address 
given below. 

SIRYOU HA IKA DE TEIKYOUSURU 

publication +subj following +loc provide 
ADORESU Nl SUTOKKU SARE MASEN 

address +loc stock +passive +neg 

This publication could contain technical 
inaccuracies or typographical errors . 

KONO SIRYOU HA GIJYUTSUTEKINA 

this publication +subj technical 
HUSEIKAKUSA ARUIHA INSATSUJYOUNO ERAA WO 
inaccuracy or typographical error +obj 

FUKU ME MASHITA 

contain +ability +past 

The overall accuracy of the translated sentences was 
about 63%. The main reason for translation errors was 



^^We decided to include the passivization feature in the 
semantic representation in order to determine the proper 
word ordering in Japanese. 

^* Japanese translation reflects the errors made in English 
analysis. For example, the auxiliary verb "could" is misin- 
terpreted in the last sample sentence. 



the occurrence of errors in lexical and structural dis- 
ambiguation by the syntactic/semantic analyzer. We 
found that the accuracy of semantic transfer and sen- 
tence generation was practically acceptable. 

Though there were few serious errors, some occurred 
when a source TDAG had to be completely "para- 
phrased" into a different TDAG. For example, the sen- 
tence 

Let's get started. 

was very hard to translate into a natural Japanese sen- 
tence. Therefore, a TDAG had to be paraphrased into 
a totally different TDAG, which is another important 
role of semantic transfer. Other serious errors were re- 
lated to the ordering of constituents in the TDAG. It 
might be generally acceptable to assume that the or- 
dering of nodes in a DAG is immaterial. However, the 
different ordering of adjuncts sometimes resulted in a 
misleading translation, as did the ordering of members 
in a coordinated structure. These subtle issues have 
to be taken into account in the framework of semantic 
transfer and sentence generation. 

Conclusions 

In this paper, we have introduced tricolor DAGs to rep- 
resent various degrees of constraint, and defined the 
notions of semantic transfer and sentence generation 
as operations on TDAGs. This approach proved to 
be so practical that nearly all of the source sentences 
that were correctly parsed were translated into readily 
acceptable sentences. Without semantic transfer, the 
translated sentences would include greater numbers of 
incorrectly selected words, or in some cases the genera- 
tor would simply failp^ 

Extension of TDAGs for disjunctive information and 
a set of feature structures must be fully incorporated 
into the framework. Currently only a limited range 
of the cases are implemented. Optimal control of se- 
mantic transfer is still unknown. Integration of the 
constraint-based formalism, defeasible reasoning, and 
practical heuristic rules are also important for achieving 
high-quality translation. The ability to process and rep- 
resent various levels of knowledge in TDAGs by using 
a uniform architecture is desirable, but there appears 
to be some efficient procedural knowledge that is very 
hard to represent declaratively. For example, the neg- 
ative determiner "no" modifying a noun phrase in En- 
glish has to be procedurally transferred into the nega- 
tion of the verb governing the noun phrase in Japanese. 
Translation of "any" , "yet" , "only" , and so on involves 
similar problems. 



The Essential Arguments Algorithm |9[ might be an al- 
ternative method for finding a successful generation path. 



While TDAGs reflect three discrete types of con- 
straints, it is possible to generalize the types into contin- 
uous, numeric values such as potential energyW. This 
approach will provide a considerably more flexible mar- 
gin that deflnes a set of permissible translations, but it 
is not clear whether we can successfully define a numeric 
value for each lexical rule in order to obtain acceptable 
translations. 
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